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NOTES 



from the Editor 



Analyses of research reports are grouped into three clusters in this 
issue of ISE, plus three additional analyses. The fi,rst cluster, METHODS 
AND MATERIALS., contains three studies. The second cluster, DEVELOPMENT OF 
OBSERVATIONAL SKILLS IN CHILDREN, constains three studies conducted by the 
same pair of investigators and analyzed by one reviewer. The thir'd cluster, 
SCIENCE PROCESSES^ contains two studies.- Individual studies focus on 
formative evaluation, concept learning, and the effects of a teaching 
experience on the attitudes of ^eservice teachers. 

Publishable responses to the analyses and to the grouping of studies 
are encouraged. , ' 



Stanley L. Helgeson 
Editor 



Patricia E. Blosser 
Associate Editor 
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Wideen, Marvin. "Comparison of S,tuden't«Outeoines fpr* Science — A Process 
Approach and Traditional Science Teaching for Third, Fourth^ Fifth 
and Sixth Grr^de Classes; A Product Evaluation," Journal of ^Research 
- ^ in^Science Teachin^c>, 12^(1) : 31-39', 1*975.. - • ' ' ' 

. * Devscriptors — ^Academic Ach ievenicut ; '^Curriculum Evaluation;. 
' '^Elementally School Science'; EJementcTry ]iducation^; 'Process 
Education; ^^Program Evaluation; Science Education*; Science 
^Course Improvement Project; '^Student' ^Attitudes 

Expanded Absttfact and Analysis Prepared Especially for I.S.E, by David P. 
Butts, The University -of Georgia. ^ ' . ' . 



Purpose • . . , • , 

;■ ■ ■ ■ j\ \- ' 

The purpose* of this ^tudy Vas to document tne effectiveness of* Science — 
A Pro^^ss Approach in contrast with "traditional science^' peaching. 
Effectiveness w^s definedas including outcome variables o-f .kndx^fledge , 
process skills', interests, attitudes, and students* view of the classr.ox)m'. 
, Ifi addition to the curriculum, 'student characteristics of intelligence,*^- 
* \^ender, and grade level were 'also thought 'to be related to performance 
yon tne outcome variables. ' ' \^ , ' ' / 



Rationale - * ' , ^ ' - 

.(' " ' ' : ■ ' . 

Results of previous research suggest that outcome measure of science 
instruction have not been fconsistetit>ly found to be related to the curric- 
ulum. The implied contextual model* is that summ^tiVe .^evaluation* should 
be done to show how student knowledge and attitude outcomes are relatllfel 
to the coatent of instruction, , . - • ' . • ' 



^ Researqh Design and Procedure • ^ ^ ^ 

Using a non-equivalent control group design, 531 stu\|^ents from grades "3' 
^ to 5 in two rural school distlricts were pre- and posttesfed on six 
measures related to 'achievement and attitude . ,The experimental classes * 
had instruction based on S cj e n c e — A Pro c e s A p j) r o^^icl iox an- unspecified 
time per' week and for an undesignated number of weeks. The control group 



5k 



received "tradit ional instruction" for a similar imspecif ietl, time per ' 
week foa* an unkuovvTi duration. Reliability of instruments are reported- 
No estimates pf tlie validity of the' measures for the variables of this 
Lvfen , 



study are gi\ 



r 



Findings 



Students in rural South Dakota (third to sixth grade) who had the eyperi- 

mental treatment showed a greater growth in science knowledge/ No 

differen&cs in interest in science were found related to achievement and 

no differences i^ the students' .perception of the classroom were found 

«• ^^^^^ 
when compared with similar groups in -"traditional" classes. 



Interpretation 



Interpretations a^e focused on discussing how^ similarity in performanae 
on the outcome measures ^may, be ihdioativ<^ 6f the fact that they are 
measuring different variables than those as originally expected. Uncon- 
trolled factors in the school context are also suggested .as possibly 
influencing the results. i ^ •■ 



ABSTRACTOR'S ANALYSIS' 
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In his introduction, the author identifies that preyious studies have 
examined several variables which may be predictors of achievement: socio- 
economic background, age, curriculum, reading readiness, gender. He also 
noted that specif ip student; variables of creativity, interest, and reading 
readiness may also be influencing and influenced by the students learning. 
Wiat is missing from this study is an explicit framework on which to hang 
the previous research or the questions of this study. The inferred frame- 
work is . * 



CURRICULA 




ACHIEVEMENT 



ATTITUDE 



By the entl of the study, recogriitiou is gl^en to the fact that "curric- 
ulum*' is 'likely too global a variable and. should include at least a 
recognition that thfe school, the teacher, and the class sociology are 
kSy factors in predicting either achievement or ^attitude. 



CURRICULA 
TEACHER 
CLASS SOCIOLOGY 




ACHIEVEMENT 



ATTITUDE 



In a'valid study, one has confidence that the relationships described 
^ % - . 

in the conclusions are indeed present. In this study, experimental and 

control groups are identified. The extent to which these were different 

or the validity of their difference is omitted. In the absence of infor- 

mation about what specifically was done to whom , when , the limitations 

with which the conclusions can be believed are enormous. If this' study 

is intended to provide the reader with a summative yiew of how program 
/ 

A compares with program B, the very globalness of the treatment leaves 
the 'results 'so tentative that they are extremely limited'in their useful- 
ness. An alternative and stronger design ^s one that would have documented 
specific changes in achievement, for identified objectives after an, expli- 
citly described treatment. 



In the absence of explicit objectives for either^ treatment or control, why 
one would expect differences in the range of outcome variables is omitted 
from tl^e manuscript, 
re freshing. 



However, the clarity of the written report is- 



In studies where in'struction is being evalU;^ted, the^changes instruction 
is expected to produce under explicitly described circumstances and ,with 
what kinds of students must be include?. The omissionNj>f ea^h of these 
three <fescriptors seems to be amply illustrated in the literature. 
Further studies would provide much more meaningful data and*^*&e^ore use- 
ful to deci sion-maker^ i f ihc-y 



1) described the specific outcomes of the instructional program 
which are - ^ 



2) ,to be ach 
ajid 



J. 



eyed lit smalJLer tine.p&riods tlHin elitirt;^ year" 



icat/le conditions with three iq[entified variables 
nfluence the i!upact<?oi instructioc^j e.g., the 



3) under rep 
k^nown to 

teacher, the contfrxf / and the student 
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Expanded Abstract and Analysis Prepared Especially for I.S.E. by Frances 
bawrenz. . ' « • * 



Purpose ' ' <K ^ 

«• ■■ ■■' — • — . 

The purpose of this article was to examine the relationship betwOcfi ele- 
mentary science teacher variables and scieiice teaching practices. The 
investigation procebded in three steps. First the authors identifieicl 
the^ relationships, then they hypothesized a m(?del to de^cril^e and explaiil • 
them and, finally, they tested the, model. 

The data employed in the 'study were from the elementary teacher segment 
oY the data from_a national survey of science Leaching practices conducted 
t>y the Faculty of Science and Mathematics^ Education in cooperation with 
the Science Matheinati(j^s ^nd Environm^t.al Education ERIC Clearinghouse 
at the Ohio State Universit^y. ' ^ > 

K 

• ' ' ' t 

Rational e • . , • ^ 

• ■•■ ' 

The article provides no discussion of the theoretical basis for tlie 
investigation allhpugh the underlying assumptions must have been that 
^ there ^ere indeed relationships between science teacher variables and 
teaching practices and that a path model analysis of teacher question- 
naire data was a good vehicle for examining these » relat iojiship^; , Also, 
inpllcil in the study was the hope that the formulation- of a model of the 
Tela^t ionships would have some value, i.e., that it would contribute to 
a mor.e accurate conceptualization of the interaction between teachers 
and teaching. '^^^^^ ^ • . 



A sample of^2,9A8 s-ctooIs was drawn from a population consisting of all 
.the public elementar^^chools in the Mideast, XowJIngland, and Southwest 
regions of the United States during the 1970-71 school year. ' From^these , 
1,444 schools responded -ajnd usable data weye received from^^SO. Because' 
initial analyses indicated"" no difference between the three sampled regions 
teacher questionnai-re data from one region, the Mideast, vevcs selected 
for this • investigation, Through factor analysis, the 72 teacher- question- 
naire items that discriminated among responc^ent§ .wer-e reduced to eight 
factors^ Factor 1, the Availability of Supily and Equipment Budget; 
Factor^, Teaching Experi<:nce; Factor 3, Use of Science Course Improve- 
ment Projects; Factor 4^ Time Allotment for Science; Factor 5; Use of 
Alternatives to Standard rns,tructiQnal Programs; Factor 6, Use of Audio- 
visual Aids; Factor 7, Science Teaching KnoWledge; Factor 8, Science 
Inservice Pcirticipation. These eight fa'ctc^rs were then used as the 
variable^ for forming a path analysis model, ' ' 



A total of eight models were constructed. The first was built using 'the 
intra factor* correlaCions from the factor analysis for the Mideast regions 
This initial model graphically presented the eight factors with lines 
joining thpse with cotrelation^signi^^ant* at p£ .05. Causal direction 
was hypothgssJgj^L and arrows indicating this were included in the diagram. 
For example, ^was hypothesized that Factor" 1 (the availability of 
supplies" and equipment budget) caused Factor & (Science Inservice Parfici- 

.„„„.,.. , ■ 

Once the .causal direction of the relationship be-tween factors had been 
hypothesized, the formulation of a path analyses model using path^ degres- 
sion coefficients was possible. These path regression coefficients v/ere 
detejminod following Blalock's equation and represented the effects of 
one variable on another- with the effects of the other variables held 
constant. * The first path model was drawn Using regression coefficients 
obtained froiTi the Mideast data. The^eight. factors were presented in a 
diagi'ain^with directed lines connecting tl^ose with significant (p <'.05) 



regression coefficients., Tvo more models were then constructed; one 
J' 'us*ing the data for the New England region and one using the data for the 
'Soutthwest region. Finally, to tost the reliability of the method .of 
model development, four more model's were constructed by dividing the 
data in the 'Mideast region in half, by two methods and forming a model 
for each. The two methods of splitting were odd and even numbered cases 
'and every other pair ;0f cases, ' * - * 



Findings 
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Because each of the. seven path analysis models showed somewhat different * 
relationships among^the factors, it was concluded thalt ^ the relbt ipnsh\ps 
betwejen the eight factors vary for the' given population and that the 
models fluctuate with the sample. The source ot the^e differences was 
explained as being due to sampling or regional diffei*ences or both. 
•However, \hese results' are confounded and could not be separated. In 
spite of the differences among the models, they all showed a relationship 
between Availability of Supply and Equipment Budget {B^ctor 1) and , 
Science Inservice. Participation (Factor 8). In three of the seven path 
mod.els, F'actor 1 vas connected to Factor 6 (Use of AV Aids) through Factor 
8 and in four of the. models Tactor 1 was conaected to Factor 3 (Use of 
science Course Improvement Prajects) through Factor 8. 



Interpretations - ^ \ 

The authors -cfoncluded that the availability of a budget for supplies and 

equipment may havt^ a directional effect through science ins^rvice^ part ici- 

^ \ 

pation on the use of science course improvement projects, audiovis*v/al 
aids'and/or alternatives to standard instruct iona^programs. In oljl^er 
wordls, an adequate budget may pot be sufficient motivation t;o get teachers 
tfo use Innovative teaching 1:echniqucs, inservice training may also be^ 
necessary. > ? 

Three implications were suggested from the results of this study* 



0^ 



81 
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- ; 

!• EitiJi|<?ntary science teachers need to have budgeted .araounts for 
supplies and equipirient, 

2» Inservice and pteservice teacher-education programs available to 

elementary science teachers should acquaint them with the materials 
•and help them develop the skills necessary for good scie^e teaching, 
i.e.: ^' , , • * 

af Use of Audiovisual Aids (Factor 6) ^ 

b. Use of Alternative to Standard Instructional Programs 

(Factor 5) . ' 

cr Use of Science Course-Improvement tProjects (Factor^) ^ 

3. Inservice and preservice teacher-education programs should provide 
guidance as to how teachers can influence budgeting for supplies and 
equipment in their school system. 



ABSTRACTOR'S ANALYSIS 

The path analysis model utilized by Nelson and White is an interesting 
way of examining the traditional question of the relationship between 
teacher variables and teaching practices. It offers some unique possi- , 
bilities. Since path analysis is an uncommon technique in science 
education research, it may have b^en useful if the authors had included 
a discussion of the methodology involved and its unique advantages for 
the investigation of the teacher/teaching relationship . Generally, path 
analysis is a method for producing a schematic diagram of related varia-' 
bles'. Formulation of a model provides a specific link between an a 
priori theoretical notion of causal connections and quantified estimates 
of causal impapt. The method presents the same type of information pro- 
vided by multiple regress^ion or correlation techniques but with' ^ causal 
component. The advantages*V§v.path analysis are its graphic nature of 
presentation, tl^ opportunity to hypothesize causation,' and the demon- 
stration of mediating variables. Its limitations are the less than 
straightfox;wnrd nature of the rejsults and the problem of hypothesizing 
causal direction. 'Chis proElcm is particularly acute when no time con- 
straints or eflipirically dctepj»ined information is available to concretely 
identify causation. As in this study, for example, it may be just as 
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likely that teacher Science Inscrvlce Participation (Factor 8) would cause 
an increase in the Availability of the Supply and Kquipment Budget (Factor^ 
• 1) as the other way around. \* 

While the research dcs^ign is sound, there .are a few instances where more 
inf o.rmation would have been 'hdlpf ui . No doubt these g^ps could be filled 
. in by reading Nelson's thesis but it would be^more convenient if they- had 
been included in the article. The authors do not report how the sample 
was drawn, how the data were collected or, since the data, were analyzed 
by region, how many respondents were in each region. There is , no mention 
of a nonrespondent study nor of what might have made oVer half of the 
collected data unusable. Also, the comparative analyses between'' the 

► r 

regions are not described nor is the method of factor analysis. 

The interpretation of the results is somewhat confusing. Perhaps this is 

due to the lack of any discussion leading tp the formulation of the research 

plan. " It would have been beneficial to see how the present study fit into 

the general context of investigative work on teache^r variables. Such an 

in-depth analysis would also have helped to support the causal directions / 

* ' \ . - 

applied to the model. In this same vein it would have been instructive if 

^ the authors had -devoted some time to a discussion of the conditions in the 
elementary science classroom. More attention to other research oi' a theo- 
retical basis would also have helped to flesh out the conclusions and 

implications sec1:ions of the article, ^ 

I* 

I • 

It wa^ difficult to understand why eight models were necessary, If^the 
regions wex£.J<nowp to be the. same, why should a model be formed for each? 
Why were two sets of halves formed? Possibly the generation of several 
tnodels was an attempt to empirically verify the initial model. Usually, 
however, in path' analysis ^ theoretical model is proposed and then data 
are obtained to test the validity of the model; the theoretical relation- 
ships are quantified. In this study Che authors seem to pay little 
attention to the extent of the relationships indicated by the bodels and 
concentrate instead on the number of times the relationship appr^aT? 
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In summary, it seems that the article provides important and reJevant data 
but that it is not presented in the context of previous research. Also, 
although the method of analysis employed in the study offers interesting 
possibilities for future research, *it is not described in detail. 
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Daugs, I>. R. ''Inlluence of Multilevel Science Materials on Achievement 
of Sixth Grade Students." Jouni a] of Research in Sci e nce Teach ing, 
10(2):147--1'52 , 1973.- " ' ' ^ 

Descriptors — -AcadeTnjc Achievement; -Earth Science; Educrftional 
Research; Elementary School Science; '^*InstrucG*ional Materials; 
^ ^M^dia Selection; Reading Difficulty; -Reading Level; Science 
Educat^oft * ^ 
• , ' « ^ *. ' 

Expanded Abstract and Analysis Prepared Especially for I.S.E. by Eugene 
Chiappetta, _ University of Houston*. 



Purpose 

The study was designed to investigate the hypothesis that students will 
make greater gsins- in achievement if they receive instructional materials 
at their reading level than if they receive instructional materials that 
are above their reading lev6l. To elucidate this hypothesis the following 
questions were researched: (1) Is thei^ a significant difference in 
achievement on criterion test items between classes of sixth grade stu- 
dents who study material -written at one level of difficulty and^classes 
of sixth graders who study material written at five levels of difficulty 
when students are placed at their respective reading levels by an informal 
reading inventory?, and (2) f^S'^=^h^iss& a significant difference in achieve- 
ment on criterion test items between classes of sixth grade students who 
study material written at one level o^ difficulty and classes of sixth 
grade students who study materials written at five reading levels when. 
Students are placed at their respective reading- levels by' an in^rmal 
reading inventory when claa^srooms are statistically equated with respect 
to teacher rating? ' " ' 



Rationale 

Daugs was investigating; the importance of matqhing the science test irjater- 
ials to the reading level of the intended student. He analyzed Fryback's 
work which dealt with the use of pnultilevel reading materials by placing 
fifth grade students at one of five reading levels in SRA's '*7}ia Earth's 
Atmosphere." Fryback reported no significant difference in gain between 



•11 ^ 



3 



• ^cla$srodms using five levels or Using one level of innti^rial. At first 
Daugs questioned the use of the Standardized Metropolitan Reading Test 
as a, -me\ins 'of placing students at their appropriate reading levels. 
However^ after investigating the resylts of assigning students to their 

^ res*pec-tive reading levels using an informal reading inventory, -Daugs 

hy|)0thesi2ed'^hat the tecicher variable was 'masking the effects, of natch- 

*r iAg text mateSrial to stildent reading level. Daugs reasoned that in the 
classrooms where all stiidents v/ere exposed to the same reading level 
materials, t^ie teacher assisted students who were disadvantaged by the 
materials alcove tKeiX' level . 



.^2 °3 °4 



VThere R, and X represent the eight sixth grade classrooms using the SRA 
i * 

science reading materials written at five levels of difficulty. 



R2 represents the seven sixth grade classrooms using science reading 
materials written at one reading level. - The Metropolitan Reading Test 



Research D esign and Procedure ^ 

-< 

Fifteen classrooms of sixth g^^d^ students participated in the study. 
Eight classrooms were ^issigned to the experimental treatment 4sing multi- 
level materials and seven ^classrooms were assigned to a control group 
using one level of material. A total ofj 368 pupils participated in the 
study- The pupils were assigned to reajding levels using an informal 
reading level inventory based on the'^cience materials actually used in 
their classrctoms (SRA's "The Earth'-s Atmospliere") . Tlje Metropolitan 
Reading Test was used to equate the rwo^reatment groups on general r^ad- 
ing ability . ^ . . . 



The experimental design was a pretest-posttest control group design 
described by Campbell and Stanley. The design is designated: • 
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was used as a measure of general reading ability ^^t\d ^the scores iron 
this test were/ used, to equate the two treatment gv^oups,. 

The Cloze readability procedure was employed as the criterion test for 
the pretest and p,03ttest measures. The Cloiie procedure ^is generally 
used to measure reading comprehension and can be us€Ki to a^seas suKject 
matter achieve men.t, as was the case in the presflmt study, Hith the 
Cloze procedure every fifth or seventh word is deleted from textual, 
material and the student is expected to fill.iti the missing word, thusT^ 
emonst^rat ing his knowledge* of the •material^i.'^ 

Xbe teachers' ^ability to manage readirhc instruction was b^eiieved to be a 
variable contributing to the e f fectivenes^ of the reading program. For 
this reason, the^ teachers were rated on criteria in 13 subcategories as 
to their effectiveness, in managing the ^ treatment reading' programs . The 
ratings were made by an observer visiting the classrooms during the 
scie?rTce reading instruction. The mean teacher rating sc9res were used 
/to equate the classrooms statistically by moans of the analysis of covar- 
i^C^ procedure, with teacher' rating as the covariate. - 



Findings 

Analysis:. of variance procedures . revealed na significant difference between 
experimental and contrpl groups on either the subtest gains or on total 
gain scores. There wcis a signific'ant difference T^etween teacher manage- 
ment ratings of the experimental and' the control groups. The teachers 
in the experimental treatment who managed five levels of reading material 
received the lowest ratings. 



Interpi;etations 

The results of this study indicate that matching, pupils' reading levels 
by employing five levels of .science textun] mntetials is no more cffcc- 
tive than using one reading level. 11ii55 result confirms. earlier rc,s^arch 
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finHinp,s.. AnothGi^^^;^sult of the study was that the teachers who, employed 
the multilevGl read JnV^pro gram were rated lower on thei'r ability to 
manage this instructional procedure than the teachers who employed a 
single level reading program. However, the apparent vari^ion in teacher 
effectiveness had no effect on the treatment. 



ABSTRACXOR'S ANALYSIS 

^ f 
Upon reviewing the present study one is taken back by the fact that .the 
multilevel approach to science instruction was no more effective fhan the 
single reading level approach. Initially one would expect the research 
to substantiate the^tility of the multilevel approach- This expectation 
is built upon several factors- First, on intuitive grounds the multileve 
approach appears superior to the single level approach. Second, the pre- 
sent study IS a refinement and an extention of a previous study performed 
by Daugs ^d other researchers* cited by Daugs* Third, the experimental 
pretest-postt*est research d^ign is a good design and employed well by 
Daugs. Given the abt>ve, why was there not a^ significant, difference, in 
the treatments? 

There are at least two 'possible reasons for treatments producing similar 
results.^ The first has to do with the duration of the treatments^ The 
research report does not indicate. how long the students participated in 
the reading instruction. If the reading instruction was relatively 
short in duration, it is conceivable that^ the enhancing effects of the 
multilevel approach would not be demonstrate'd. If, however, 'the readingr 
instruction for both treatments was long in duratiX)^, say several month$^ 
for several hours each week, then the advantage of the multilevel treat- 
ment might be evidenced. 



Another* reason why the mult;^ilevel treatment 'did not- live up to expecta- 
tion might be that many other -non-reading activities were contributing 
tQ the achievement displayed by both groups. Presumably there were ' ^ 
laboratory activities associated with the SRA science reading lessons 
which benefitted both groups. There we4e probably classroom discussions 
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related t^'th'e science activit Losv that |^upils«in both groups' participated 
in. Irv addition ^ th'e re occur numerous activitie^s in school, inside of^ 
outside of langya^^ ^^^f cl^s^s, whivch pertain to language arts that 
would iwfluencp the type of acliieve'ment iricasured by the Cloze procedure. 
Hence, if ^ other activities were dominant to the science reading exper- ^ 
iences, the effects of the multilevel approach would be itfasked. 




The present line of , research should be continued. It is important to 
the prbfession to beLtejr understand th^- Tjr^lationship between reading 
m^te^rials and sciepc^^^ achievements Subsequent studies^ should employ a 
reading treatment* that is long id duration and that is dominant^to other 
activities that might impart similar knowledge. In this manner the true 
effects of fh'e reading trea'tment(s) can be^ ascertal'ned. 

^ z 
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BaiHjfaldl, Jar^es P., ."and MauroGn A- Di^tx. *'The Pcrf orTnancc of Children 
on Visual Observation and Comparison Tasks." Sc j-r ncc Educ a i ion , 59 
• • .■ (2) :199-205, 1975. ' • . - ' ' 

Descriptors— ''-AchiGVemGnC ; '^'Educational Research; Elementary 
^ Education; -i^lemontary School * Students ; Learning Activities; 
Science Education; ''^Visual Learning; '^Visual Perception; Visual 
StimulTr 

• Expa,nded Abstract and Analysis ^^repared Especially for l.S.E. by William 
S; La Shier, Jr . , ^University, of Kansas. ' ' ^ 

Purpose ^y^* . ' 

^The following questions were investigated in this study: ^ . 

1. Does the use of different types of visual stimuli (solid' objects, 
photographs of the objectffe, and drawings of the o'bjects) result 
in different observation and^ comj):ar^ison task scares amo'ng groups 
representing the same or different grade levels? ' 

2. Are there differences between the tnean scores 9f boys and girls 
on each of the fouxj observation and comparison task areas oj 
color, si2e>. form and fbrm-det^il? 



Rationale ^ ^ I „ ' 

The development of Observation and comparifjon skills tos cited as av4najor 
objective .6 f many of the . elementary science programs. These skills require 
that students collect information about the physical attributes of an 
object and recognize similarities and differences among thes§ attributes 
chiefly through the sense of sight. 

* » ' 

Research by Stevenson and McBee (1^ ^ndic^ited that young children discrim- 
inated, size more efficiently when using ^hree-dimensional objects .than when 
presented with pattern or slightly raised objects, ^ornbush andVwinnick 
(2) found that young children learned to discriminate^ between two sValids, 
cube and parallclopiped , faster than between a ;5quare and rectanpj e. XThese 
and related studies sug?>ested that the performance of children on. tasks of 
observing and comparing when presented With solid object mayNlj/ffer from 



that deinqnsLi*atod when they are prcscMitcd with a two-dimensioital repre- 
sentation of: the object. Thus, the results of this study may provide the 
science educator with divection in the choice and nature of instructional 
materials used to teach process skills. , * - ' \' ' 



Research Design and Procedure » 

Three treatment ^^ups of elenwntary school- students were formed - t6vrespond 
to a set of 14 visual observation and comparison ta$ks which ~Vere adndinis- 
IS tered orally^ Two solid^obj ects, a cube and a cylinder, served as a visual 
stimulus for the first treatment group 'as it r.esponded to the 14 tasks, « * 
The second treatment group had- photographs of the cube and a cylindet to 
refer to in responding to the questions. The third treatment group had 
drawings of the objects as their frame of reference in responding to the' 
lA^tasks. Three tasks focused upon the physical attribute of color, two 
dealt with size, five tasks focused upon form and four tasks dealt with 
form-detail.^ A score of 1 was assigjied to each correct response to a tas-k 
and a zero to incorrect responses. A total score and four subscores were 
calculated. / 
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Two hundred forty randomly selected students from grades one, twoy^foui;, and 
six wer^- used in this study. The subjects were from two urban elementary 
schools with a racial composition of 99 percent Black and 1 percent"^rientai 
and others. The overall measure of task reliability, Kuder-Richardson No, 
'20, yielded a value of 0.664. r 

The posttest only research design called for a series of two-way analyses 
of variance with thre^types of visual stimuli as treatments' and four grade 
levels. A separate ' two-way analysis of variance was Jperf ormed for each of 
the four dependent variables of color tasks, size tasks, form tasks, and 
form-detail tasks. Post-hoc comp'ar isons were pcrfoifmed to locate specific 
sources of significance indicat ed ^ y the two-way analyses of variance. ^ . 

Separate one-Way analytics of variance were used to determine if there were 
any significant differences between male and female responses to each of 



the four sets of tasks. 
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Fi nding s < , ' 

The results 'Of the study indicated that there were no differences among 

• J 
the three treatment groups in terms of the Ineali number of correct resonses 

tOothe observation and comparison tasks dcaling'with color. There were, 

however , significant differences J^etween first graders and sixth graders 

to the tasks employing color, with the older children giving more correct 

jai^swers. - ^ - • * 



The three treatment groups (solid objects, photographs, ^drawing^ ^id not 
differ significantly on the tasks employing the attribute of sizB. As with 
the colon tasks, a significant. difference was noted between age levels with 
older ' childi?^en giving more correct answers. 



The treatmg^ groups differed significantly on the tasks employing 

the attribute of form. Post-hoc comparisons of treatment effects indi- 
cated that the use of solid objects or photographs resulted in jwore* correct 
student responses th^it the same tasks employing drawings. ^AgaiiTu signifi- 
cant differences were noted between age levels with six graders making more 
correct responses on the form tasks than the first, second, and fourth 
graders. ' ' 

The three" trecitment groups differed significantly on the tasks employing 
^ f orm-detfail . Children made more correct responses on the form-detail task . 
^ when viewing the photographs in contrast to vie'wing the drawing of the 
objects. Significant differences were noted among .the grade levels with 
sixth graders making more correct responses than first, second, and fourth 
graders. 

' Vhen^the mean pumber of correct answers of males and females were contrasted 
for each of the four sets of task^, only one significant difference was 
noted; females made more correct responses on the color tasks than did the 
males. * 
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The mean scores w(*re higher on tasks which focused on the attribute of 
form when children observed and compared either* the solid objects or the 
photographs of the objectstT: Ih coinparison to children using drawings as 
the stimuli. Post-hoc comparisons of treatment effects indicat^d^ that 
children made more correct responses on the form-detail tasks when view- ^ 
ing the photographs than when t-hey viewed the Srawings of the objects. 
Mean scores on color or size tasks were not significantly different among^ 
the three ^treatment groups.^ - * 

The older Jthe children, the more successful they were in -^performing the 
tasks. The fem^iles ' surpassed the males on the tasks related to color. 
^JRhe implication is that males in the elementary school may have to be 
pft>vided with additional experiences with color observation and comp^ri- V- 
son fasks. • • . • 

The implication of' the^ study focuses on the need> for careful selection of 
appropriate teaching material and method with consideration given to the 
age ax\d sex of the learner as well as the* color, size, form and form detail 
among the instructional materials. , • • ^ 



ABSTRACTOR'S ANALYSIS 

J I I ^ . ' 

This study is related to two other studiesj, Dietz and' Barufaldi (197^5) "and 
Barufaldi 'and Dietz (1975^), which are botii critiqued in the pres^ik ISE 
volume. In the first -study, Dietz and Barufaldi (3975) studied thel^^fect 
of presenting novel vs. ordinary objects on student observation skill. 
The observation skills of youngsters Vere also tdsted under the conditions 
of con^paring two objects vs. presenting one object at a time. The final 
activity required students to recognize both similarities and differences 
of objects with the obj ectivF^T determining, whether similarities were men- 
tion'ed more than were, differences. 




•The Bdrufaldi and Dietz (1975V study investigated the differential effect 



of using either- solid objects or two-dimensional representations of objc'^t^*- 



on the liubsoquont perf ormajice of youa^'.sloiTT on visual observation and 

comparison tasks. These two previous studies^both used basically the 

same sample p6pulation of young^^ters scIectK^d from one elementary school, 

\ . ' . 

The present study concisely outlined the- effects of different types of 
visual stimuli on the performance of children on visual observation and ^ 
comparison tasks. The article would have been cleArer, however, if a' table 
of mean scores by grade level had been included. ^ ^ ' 

The authors spoke of the educational implications of the study but pro- 
vided only a few concrete suggestions for teachers such as the need to * 
i>rovide males in the elementary* gra^des w^th additional experience with 
color observation and ^ompa^ison tasks. ' 

In a recent descriptive article, Thier (1976) described alternative 

approaches f.6r developing visual perception skills through the use ^f 

selected science experiences. Marldne Thier, an elementary school learning 

disability teacher, provided some suggestions to enable a student to over- 
do- #' 

come visual perception problems through involvement in science activities 
in many qf the new curricula. She also pointed out that the skills needed 
to work with SCIS Mr. 0, for example, are^^^r^lated t^ reading. To read 
successfully, a child must ie. able to discrii^^ate between words that are* 
similar and words th^t are different?, along wit1i other related skills. 

Another instructional approach would be to 'directly te^h for visual per- 
ception. Ritz^^nd Raven^(19.70) investigated the impact of science 
instruction y^nd direct instruction in visual perception on th^ subsequent 
readl/fg readinessj of kindergarten children. In their study, three groups 
of li^ndergarten ^Cbd'ents were used to investigate whether AAAS Science - 
A Process Approach or instruction in' a portion of perceptual training 
program would enhance the reading readiness attafnment qf the stud^Xs. 
The third group of Y^^^gst^^, received instructioTi in both the AAAS and 
the Frostig material. None ofs.th& three treatments produced' significant 
differences and tlie researchers concluded that science and/or visual 
perceptual training can be included in kindergarten pro<*rams without 
impairing the rpading readii^ess of the youngsters. 

21 "♦'^ 



The reluctance, of Barufaldi and Diet^, in the present study, to general- 
ll^e 'the bindings td other groups was justified because the sample, population 
wa^ 'composed of 99 percent Blac4c s^tudents from a neighborhood in which 56 
^ percent of the contacted families reported incomes of less than $5,000 per 
year. v'The. socioeconomic status of the students may have influenced their 
performance on the various visual tasks. Lowery and Allen (1970) examined . 
the performance of upper middle, and lower socioeconomic groups of first 
graders on 35 Jrasks of resemblance' sorting. This term meant grouping 
to'gether two objects because they are 'similar in some way. The nonverbal 
tist booklet -consisted of drawings and figures. The findings indicated 
that low socioeconomic group ^scored, significantly lower on the sorting 
tasHs than the other' two groups. Also, a hierarchial sequence of diffi- 
culty for- the sorting task was evidenced, 

* / • 
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Expanded Abstract and Analysis Prepared Especially for I.S.E, by William 
S, LaShier, Jr., University of Kansas. 



Purpose ^ 

The problems investigated in this ^study are related to the following 
^ three questions: 



1, Do children demonstrate greater skill in observing an ordinary 
^ ^ object than in, observing a novel object? 

, 2. 'Do children demonstrate greater skill in 6bseirving two objects^, 
presented one at a time, than in comparing t?he two objects? 

' 3. Do children demonstrate greater skill in recognizing differ- 
ences between objects than in recognizing similarities in 
. objects? 

Research Design and Procedures • 

Thisnposttest only research design was used, with 66 randomly selected 
students in grades one through six from one school in an eastern city. 
The students apparently were from low income families. Eleven first 
grade students responded to four tasks presented in an interview situa- 
tion. This same procedure was followed using an equal number of studen|;s 
at grades two through six. The interview data were -recorded by a 
c-asse'Lte tape*' recorder tod- later analyzed. 

In Task 1, the student was hanc^ed either a cube oi a ball and asked 
quest iofis to elicit observational statement^. 5h Task 2* either a 
gyroscope or kale»idoscope was handed to the student and observational 



ERIC 



23 



statements were again asked for. In Task 3, the student .was handed 
the two objecCs previously presented and wa^' asked, *'How are they ^ 
alike?*^ The final task was to ask the student, '*llow are these two 
objects different?'* The total number of corxect responses were 
d^t^rmined for each grade level using a procedure which had been 
modified from that of The Inquiry Skills Measures (Nelson, 1971) . 



Findings , . . 

f 

The students were equally skillful in making observations of an 
object regardless of whether it was familiar or noVel, The results 
al'so indicated that children demonstrated significantly greater 
skill in' observing than in comparing. There was also a significant 
difference among grade, levels in the performance of children on 
observation and comparison tasks, 

When ^he student responses to the comparislon tasks were distinguished 
as either being a, similarity or a difference response, a significant ^ 
difference was hoted which favored more student statements based on 
differences than those based on similarities. No significant differ- 
ences were noted among the grade levels, 



Interpretation 

9 

In helpings students to develop observational skilly the -selection *t 
of either a novel object or a familiar object did not seem to be 
crucial at any grade lev^. The study also revealed that children' 
at all grade levels^made at least, twice ^as m^ny observations as 
comparisons. The researchers suggested that perhaps comparison skills 
tasks should be' stressed more during instruction, VJhen the student 
responses related to either similarities or differences, the children 
used differences more often. This suggests that children should be 
given further training in rctogniEing similarities in objects, partic- 
ularly since similarities* are the basis for developing classifying 
skills. ^ 
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ABSTR/\CTOR' S ANALYSIS 

I ' 

L^e design of this study was e>tplained .well and the only limiting feature 
was that the sample population seemed quite narrow in terms of socio- 
economic, status and geographic "distribution. 'The results of Task 3 were 
quite intriguing, ^^y do subj^ects make more statements based on differ- 
ences than on similarities? Could the sequence in which the questions 
were posed make a difference? , • ' - . 

The present study indicated that the introduction of novel objects did 
not result in the students making more observations. In a study by 
Gillespie (1970) the use of ordi;nary. objects was recommended. 

In Gillespie's study 'a group of*96 youngsters, divided by age .(five, six, 
seven, and eight years) and sex into equal groups, were administered tests 
of discrimination .between leaves, association of leaves with pictures of 
leaves, sorting leaves into generic groups, and communication of concepts 
of^ leaf structure. No sex difference or interactions were found. However, 
there was a significant age effect on all tests, with older ch'il<^en having 
less difficulty than did the' five .year olds. It was recommended that 
children be taught to recognize the 'more common^ species of trees through 
^first-h^nd experience with leaf material , starting from age eight. 

In another study, the concept of student curiosity about science objects 
was examined. Peterson and Lowery (1972) unobtrusively assessed the 
amount of coordinated sensory-mcr,tor activity of 120 school children (ages 
5 through 13) directed toward an^ array of science objects in a waitJLng 
room. The researchers were interested in the amount of curiosity expressed 
through motor activity. This curiosity was found to be, significantly 
related to groups based on racial-ethnic origins, but not to groups based* 
on age or sex. Black children expressed significantly more curiosity 
while waiting than did non-iSlack children. A significant interaction 
effect of age, sex, and racial-ethnic origins was also found. Interac- 
tion effect was due 'to a rapid increase in curiosity from kindergarden to 
sixth grade among Black males and a gradual decrease in curiosity among 
non-Black males over the same time period. 



* PeLcrnon and Lowery also reported that children who exhibited j^reater 
amounts of curiosity usually ^asked few unsolicited questions. One 

' wonders if in the Dietz and Barufaldi study, whether the "non--signif icant 
differences in number of student observation statements when presented 
with either novel or ordinary ohjedts might*^be a result of the con- 
straints associated with the verbal question format. Perhaps a follow- 
up study might deal with relative amount of curiosity associated with 
students observing first a novel object and then a^ familiar object. 
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c 

' Purpose ^ 

6 

\ 

The problems investigated in tJie study were related to the following twp^ 
questions: 

1- Do the differences in visual perception of solid objects and 
two-dimensional representation's of the obj ects ^affect the per- 
formance of children on yisual observation and comparison tasks? 

2, Do the different types of two-dimensions representations (photo- 
graphs *or drawings) affect the performance of children on visual 

r i' 

observation and comparison tasks? 

; 

Research Design and Procedures 

The posttest only research design contrasted the responses of three groups 
of students; each group observed and compared one type of. visual stimuli^s 
(solid, photograph, or drawing). That is,* one group of students would - 
work with either* a set consisting of a cone and pyramid or the set com- 
posed of a cube and cylinder. The second group of students would work 
with photographs of one of the sets of objects. The third group worked 
with drawings of the sets^ of objects. Each of '^e three groups at each 
of four grade levels, 1, 2, 4 and 5, consisted of 17 to 20 students. The 
procedure in testing a particular group was to place on(^ of the objects'* 
or representations on the desk of each subject and ask the student to 
respond, to ten observation tasks. The second ^obj ect or representation was 
then introduced 'and the student responded to an additional *16 tasks 
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requiring the comparison erf two objeces oj neprescntH ions. A score -of 
one was assigned tio'V^fi^correct answer with the individual ) s total score 
being the total number 'of correct responses. The 228 child'rc^n selected 
for the study were from one elementary, school in a large eastern city and 
were randomly selected from grades one, two, four and 'six. 



Findings 

* f 

Comparison of group means through the computation of Bonferroni t-values 
indicated that in gra'de two the student^^who observed and compared the 
solid objects scored significantly lower than did students using the photo 
graphs. In grade four, the groups using either the solid objector photo 
graphs did significantly better /:han did the group using the drawings. In 
gracje six, the group using the solid objects did significantly better than 
did either the group using the photographs or the group using the drawings 

Interpretation 

Children in g.rades four and six w^ve more skillful on observation and com- 
parison tasks employing Solid objects than on those tasks using drawings 
of the objects. In addition, fourth "gr^^ders were more successful when* 
using photographs than when Resented with drawings. 

The authors suggejsted that perhaps educators should design experiences 
that would give clhildren greater (Opportunity to utilrze photographs and 
drawings in the development of scientific skills! 



ABSTRACTOR'S ANALYSIS 
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A review of research related to the three articles reviewed in this series 

began with an on-line computer search of the ERIC system from 1966 through 

December 1977 using key identifiers listed in the Thesaurus of KRIC 
-» — — " • — — 

D escriptors (1977). The terms, elementary school science and elementary 
school mathematics, were combined and these represented 5,008 entries 
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labelled as Set 3. The intersection of Set 3 with the set of entries 
related to the term visual stimuli produced five entries for Setf S.^^i^he 
sets relating to visual learning but not Sot 5r> and the set related to 
visual perception but not Set 5 produced 8 articles and 15 articles, 
respectively, when intersected with Set 3. Two of the articles found 
in this search suggest research directions somewhat related to the 
reviewed arlcicle. 

Champagne (1970) reported a high positive Qorrelation between the ability 
of kindergarten students to draw plane geometric figures and their abil- 
ity to conserve mass' in' the Piagetian sense. The study was based on the 
proposition that the child's attention to misleading visual clues is an 
important factor in his lack of ability to conserve mas^-vand that instruc- 
tion in drawing two-dimensional figures should affect aljility to conserv'e 

mass. / 

m 

. 

Walker (1972) reported that no significant differences were observed in 
problem solving bejiavior amoyig three groups of sixth grade ' students . One 
group received visual automated instruction, the second group received 
automated auditory instruction and the third group received no instruc- 
tion This stuSy perhaps suggested an area of research not varied in 
the thnee Bar^faldi and Dietz studies. That is,^ perhaps the manner in 
which the tasts are presented could be varied. Another possible dependent 
variable in these studies could be time spent on each task, particularly 
if the instructions were automated for the different gfadjfe, levels, 

REFERENCES 



Champagne, Apdrey Ann Briggs. "An Investigation of the Effectiveness of 
Visual-Motor Experiences in the Development of the Ability to Conserve 
Mass." Doctoral dissertation. University of Pittsburgh, 1970. 

Thesaurus of ERIC Descriptors , ,^7th Edition ^ New York: Hacmillan Infor- 
ipati'on, Hacmillan Publishing Co/, In^., 1977. 

Halker, Mary Ann. "The Comparative Effects on Two Methods of Automated 
. Instruction, One Visual and One /iuditory, in Teaching Selected Prob- 
lem Solving Behaviors ^ Two Groups of Sixth Grade Students." 
Doctoral dissertation , *St . Loui3 University, 1972. 



29 o r. 



SCIEN^CE PRO^E^ES 



9 . 



S^mour, Lowell A. 'and Frank X. Sutnum. **Critical Thinking Ability, Open- 
Mindedness, and Knowledge of tlic Processes of Science of Chemist ryj^nd 
Non-Chemist rv Students."* Journal for Research in Scien c e ^Teachi ng, 
10(2):159~163, 1973. / 

Descriptors — '"^Chemistry, -Critical Thirtking, ^Curriculum Evaluation, t 
^Educational Objeatives, Educational Researdh, Instructional 
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The purpose of the study i-s to provide a summative evaluation df a curric- 
ulum guide developed by the Chemistry Curriculum Committee of th^ 
Philadelphia Sdhool District, 

'^instruction based on this Guide was intended to develop critical thinking 
ability, open-mindedness , and knox-zledge of the pro ses of sciepce 
(p. 159), The evaluation was in terms of these constructs*. 



Rationale 

Essentially, the rationale for the ii)vestigation was that critical thinking 
ability, open-mindedness, and knowledge, of science processes are important 
goals of sicnnce instruction. This is not a researchable question but the^ 
authors do cite several references purported to endorse such goals, The,'^ 
goals are ones likely to be 'endorsed by a ma j o^irtynrf^cTenc educators, , 



Research Design and Procedure * ^ ^ - • ' 

The design used was a nonequivalent control group design (Design 10, . 
Campbell and Stanley (1963). The sample cohsisted of eleventh grade 
students in tvo^ schools, drawn from a ''populat ion of 16 schools using the 
guide , 
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Data were presented which suggest that/^he sample was representative of 
the populationr'on several variables. 

Eleventh grade chemistry students in the two sample schools comprised th( 
experimental group. The control group consisted of other eleventh grade 
students matched to chemistry students by sex and intelligence. 

The number .of subjects in the study is not mentioned but the degrees of 
freedom listed for the various F~rati<^s 'and t-tests ranged from 142' to 
182, suggesting that the number in each group was in the order of 150-200. 
The authors are not explicit about, the duration of the study. One comment 
suggests that it was one semester. 
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Three tests were administered to both, groups as pretests and posttests. 
These were: # v » 

a) Watson-Glaser Critical Thinking Apprai^sal From YM (WGCTA) 

b) Rokeach Dogmatism Scale - Jrom E (RDS) 

c) Wisconsin Inventory of Science Processes (WISP) 

Scores from pretests and posttests were used to make two comparisons. 
First, the ^pretest and posttest scores for students who used the guide 
(experimental group) were compared. Second, posttest scores for the 
.experimenta^^and control ' groups were conflfered. 



Findings 

The experimental group made statistically significant ga\ns from pre- to 
posttest on the WGCTA and the posttest^- scores of the experimental group 
were significantly higher than those of the control group. (No means 
are reported. "Signif icany** as used here refers only to statistical 
significance at the 0.05 level of confidence or beyond. One is unable 
to judge the educational significance of the differences found and the 
authors malce no comment in that regard.) 



The change in mean on'lhe RDS from pre- to posltest for the experimental 
group was not significant at the 0»05 level, biM there was a. difference 

0 

in posttest means for the experimental and control group. The difference 
apparently indicated more open-mindedness on the part of the experimental 
group. Since means' are nat reported, this Result is somewhat ambiguous. 
The authors report no significant difference in pretest mean for experi- 
mental and control groups, no significant gain by the experimental group, 
but a significant difference in posttest means for the two groups. ' 
Whether this is^ due to an increase in dogmatism by the control ,group or 
small initial differences in favor of the experimental group indicate 
a decrease, in knowledge of the processes of science but the comparison 
of posttest scores for experimental and control group shows no significant 
difference as did the comparison of pretest scores. 

The authors conclude that, the Guide did not result in an increase in 
knowledge of science processes. This appears to be a safe assumption, 
in spite of t"he ambiguous results. 



Interpretations 

The authors interpret these results to indicate that use of the Guide 
results in an increase in/ critical thinking abil^ity but not in an increase 
in knowledge of sci'ence processes. They consider the results pertaining 
to open-mindedness to bg inconclusive. 



> ABSTRACTOR'S AJ^ALYSIS 

It is encouraging that the authors made a sincere attempt to evaluate 
the curriculum change that was implemented. Too often changes are made 
witrTno attempt to determine whether the change has t«he desired effect. 
It would also appear that the variables which were examined in this 
evaluation are important ones. Most science educators would consider 
critical ^thinking ability, open-mindedness, and knowledge of the processes 
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of science to be worthwhile goals of science instruction. It is cer- 
tainly appropriate to see how thesg^ variables might be« affected by the 
introduction of the'7new Guide. 

Unfortunately, the constructs exaftiined represent multi-faceted skills 
and attitudes which not well defined. We might say thc^ they repre- 
sent concepts qf lov; validity. What one person describes as critical 
thinking, another does not; when I describe .open-mindedness , I find that 
others disagree; there are even differences of opinion about what consti- 
tute the processes of science. This is nox an argument against tryirfg 
to assess such elusive qualities but it is a cautionary note concerning 
the way one interprets the assessment. 

The readers of this report should be aware that critical thinking, open- 
mindedness, and science process skills are operationally defined by the 
instruments used in this study and that other measures of the same con- 
structs might produce different results. The constructs are not measured* 
in any absollute sense. 

Although I have not used the *RDS or the WISP, I used the WCJ^CTA in my 
thesis research and became keenly aware that the instrument only measures 
certain aspects of critical thinking. Articles by Wallen (1963), Erinis 
(1958), and Rust (1962), which I reviewed in 1965 provide ample reason 
to accept with caution any suggestion that the WGCTA tells us all that 
we want to know about critical thinking. This is not to say that it *" 
tells us nothing that we want to know. • 



My point is simply that one needs to look carefully at the tesJi/s used 
in a study of this kind to be sure that the test is measuring what the 
curriculum developers thought they had put into their Guide. If it seems 
to be doing that, the test results arti worth no^ng; if it does not, 
the results may be disregarded as irrelevant. The same suggestion, of 
course, goes. for any reader who plan# to use the result of the evaluation 
to make administrative "decisions concerning the uqc of the Guide that 
was developed. 



Apar|^*froin the Inherent limitations of the cleslf.n employed in thi^ study, 
the methodology seems' to. be appropriate. I sec no reason 'to suggest tl^at 
the results are spurious.- An attempt was made. to check the 'representa- 
tiveness- of the sample. The matching \cchniaues that were used to obtain 
the conti'ol group probably produces as nuch control as possible under 
the experimental conditions. (Random assignment is seldom possible in 
4fchool settings. We usually have to make do with intact groups.) 



I would have preferred to see the authors use analysis of covariance 
(using pretest scores as the covariate) when comparing post^st scores 
for the experimental and con'trol groups. Of the three possible approaches 
to the comparison (simple comparyLson of posttest scores, comj^rison of 
gain scores, and analysis of covariciy:^) , the Covariate analysds appears 
to be most appropriate. (For an excellent discussion of this issue, 
see Lord (1963).) 



I would have been more comfortable wit\i£he study if-, the report had b 
mofe complete. Some of the things that/ would have helped me assesss 
^ the validity and importance of the findings* are the folJ^^wing: - 
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1) Sample test items which suggest the kind of skills being assessed. 
This would help the reader decide if he^ is concerned -about the 
result obtained. It is possible to dig. up the^tests and examine^ 
them but that takes time and effort. Few of us will do it. 
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2) .Additional information aj^out the experiment;al conditions would 
help. The person who condudrts a study^is often aware of con-^ 
ditions that might have influenc<id the result. The reader should 
be told about them. This need not be an elaborate description. 
I am willing to believe an author who, tells me that students, 
appeared to take a test seriously, but when results suggest that 
this could have ^een a problem and there is no word from the author,i 
I am left to wonder. ' . 
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3) Tables of means (with maximum possible scores) and standard 
« »■ - 

deviations should always bo, included in a report. There are 

' numerous reasons that the/ means and standard devi-^tions are 
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^ important. In this study a table of means would cert ainl y help 

the reader interpret the ambiguous results for the RS^and WISP 
tests* For example, if the significant difference between experi 
mental and control groups on the posttest of the RDS resulted 
, ,from a decrease in score for the control group, it is probably 
because the control group didn't take the test '^seriously rather 
than because the experimental treatment was effective^. The means 
could help oite decide. 

Even when there is no doubt that a reported difference is real, 
a look at the means can influence hou one uses the result. 
Suppose,^ for example, tfiat phe significant difference in critical 
thinkdjig ability that is^ reported in this study was a very small 
^ difference. (A difference of a fraction of a point may be sta- 
tistically significant if the test used is highly reliable and 
the number of subjects in the study is large.) -A reader may feel 
that even though the difference cannot be attributed co. chance, 
it is stiir too small tso merit the expense and effort required' 
to adopt the innovation being reported, • * \ ^ 

' ■ ^ \ 

Perhaps the journal re^^iewers are more resppjtisible for deficiencies 
in reports than are authors. Having served on the Editorial Board of 
JRST for several years, I know thatj. authors are urged to be brief and' 
that some reviewers see no need to 'clutter up a report with ANOVA \ 
tables, means, standard deviations, and desi^x^'ptive information that ^is 
ctucial for an accurate understanding of the research reported. 

Perhaps authors and reviewers alike need to ^be reminded that the purpp^_ 
of reseai^ch (and research reports) is to provide^ new information that, 
others might use. If we k^ep this idea foremost in mnv mincis and prepare 
reports so as to maximize their value to readers rather than maximize 
their value to writers — i,e., p'roduce the maximum number of publications 
in the fiiinimum amount of space — we woilld all bef^^?^^ more from research 
that is done, ' 



In the spirit of making research worth more, the authors could have done 
one thing. They cpyld h^^ suggested whatsit is about the Guide that 
th,ey evaluated which seemed to produce the gain in critical thinking 
ability. The only people who can benefit frotfi .evaluative data which 
tell us that a. given prodact is good or bad are those who are considering 
using the product — the Guide in this case. If, however, evaluators can . 
provide some hint about why the produc^t is good or bad, many more of us 
can benefit, - r 
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Purpose ^ • \, 

The purpose of this study was to "develop group test items that would be 
good predictors of a student's success in actually performing the process 
in question in an experimental situation." 



Rationale 

Many elementary science curricula aim to develop process skills, but few 
instruments to assess, these goals exist. Those that are available (e.g., 
the SA?A Individual (Competency Measures , the BSCS Processes of Science 
Test and 'Tan nenb a urn's Test of Science Process ), "have restricted themselves 
to face validity." , The authors attempted to develqp test items that exhi- 
bit concurrent validi'ty with children's actual performance. Their thesis 
was that "face validity is insufficient for instruments of this type." 



Research Design and Procedure 



(^These researchers chose four integrated processes from the SAPA program: 
controlling variables, interpreting data, defining operationally, and/^ 
formulating- hypotheses. Tliey developed objectively scored test items to" 
be administered to groups with the goal of measuring these same four pro- 
cesses. The collection of items, calle^' the Group Test, was subjected 
to "^he "normal procedure of writing and revising with the input of expert ' 
opinion, until the authors were confident that the retained iterns exhibited 



face validity," lliese 79 items (multiple choice or numerical Sill-in) were* 
supplemented with slides and a taped oral script. 

tA group of 56 seventh graders who- had the SAPA ^istruction as sixth, graders 

was administered the Group Test and individually assessed for their perfor- 
mance on 54 tasks from the SAPA Individual' Competency pleasures of the four 

processes of intete'st. A *comparison of the^ items^or tasks to the respec- 
tive processes is listed in Table 1, 



Distribution of Items from the Group Test) 
and ICM tasks by Process Areas 



ICM 



Group Test 



Controlling Variables 




12 


18 


Interpreting Data 




15 


25 


Formulating Hypotheses 




lA 


18 


Defining Operationally 




13 


li 






54 


79 



Individual testing for each student on the ICM tasks required two 50- 
mlnute periods. Following' the 2-1/2 month ICM testing phase, the entire 
sample was administered the Group Test. 

Indices of the difficulty and discrimination for each item were computed 
on the Group Test, The external criterion for the discrimination calcu- 
lations was the student score on the ICM tasks* - After deleting items 



with discrimination indices below +0,20, the investigator calculated ' 
bivariate correlation coefficients between and among the process subtests 
from the Group Test and KM tasks 'assessing similar process* 

■ ) 
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Findings 

The exper inventors found that the mean indices of discrimination for eaciY 
p.,oc*:ss subtest of ^he Group Test ■ranp,cd from +.17 to +,33 when*22 items 
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(of the 79-itein ttst) .that had discrimination indices below +.20 were 
deleted, the mean discrimination indices for the four process subtests 
rose dramatically to .28, .35, .39 and .44, as expected. The level of 
difficulty of the items in these subtests before and after deletion was 
"relatively unchanged and very close to the desired level Qf 50 percent." 

The correlation coefficients between the similar subtests orr^the Group 
'Test and ICM were all positive and statistically significarvt at the 0.0001 
level. The coefficients ranged from +0.535 to +0.705. Coefficients 
between unlike' subtests of the two* tests were also positively related, 
and all significant at^ the 0.01 level. These ranged from +0.414 tp +0.703. 
Similarly, correlation coefficients among the four subtests of the Group 
Tqst afid among the four ICM subtests were all positive and significant, 
ranging between +0.430 and +0.601 for the ICM and +0.561 to +0.786 for 
the Group Test. 
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Interpretations ' • \ ' 

The authors concluded that it is "possible to produce objective items on 
a test which exliibit high correlation with a student* s ability to actually 
perform the higher mental processes as represented by performance on the 
integrate^ processes." 

"The likelihood that the integrated processes are, indeed, not unique is 
strongly suggested 'by the intercorrelations within each test." The authors 
speculated that success on the integrAed processes may be predicted by a 
subset of basic' processes , level of cognitive development (a la Piaget) , * 
reading ability or the student's mental ability. ..They suggested that one 
of~tliese^a's~yec unidentified variables may be influencing these strong 
interrelationships . 

The authors subsequently were to choose some of these Items and form a 

\ 

group process test, complete with normative data.. Tliey felt this test 
could be used by researchers who wish to assess the process skills of 
students in <rarious science programs. They hdped^ their work would stimu- 
late research to identify factors which influence the development of 
• science process skills and to explore the generalizabila ty of these skills 
to other content areas. 



ABSTRACTOR'S iVNALYSIS 



The problem investigated by this project is significant to science educa- 
tion and is stated clearly and succinctly. The review 'oK related literature 
was very limited. Several attempts to assess Student ability to perform 
SAPA process skills have been reported by Beard (1971) and \Jallace (1974). 
Even more specifically, Walbesser and Carter (197^) investigated the effect 
of tegt format (Individual and Group) on student's perf ormanc^e of process 
skills. Analysis of these studies may have been of some help to the^con- 
duct of the study or in the preparation of the report. * . 

The small sample size lim;Lts^the generalizability of the findings of the 
study. The students were only described as being seventh graders who had 
studied SAPA in the sixth grade. It's not clear if these students had 
SAPA instruction during any or all^ of their K-5 science experiences. The 
ability level of the students similarly was not specified. Nor was th«re 
a brief comment as to the type of school or community (ies) from which these 
students were selected. Since most schools have considerable information 
on reading and mental abilities of §tudents,, this information could have 
beei:\ easily gathered from approp^riate school folders. 



The choice of tasks from the ICM seems logical and appropriate. It can l)e 
assumed some of t!he investigators administered and scored the ICM tasks, 
but this was not described in the report. It appears that the Group Test 
was ^dminis ter^ after alJ|^ students had been individually administered the 
ICM tasks*. The. potential instructional effect of this lengthy assessment 
procedure was not discussed. ' The range and central tendency of student 
scores on these tasks was not included in the report^ It would have been 
helpful to append an illustrative e^^ample of ICM tasks and items from the 
proup Test. "Before one computes correlation coefficients, it would seem 
appropriate to describe some characteristics of the data involved. 

The analysis was relevant ancl appropriate, considering the sample size. 
No hypotheses were stated with respect to difficulty levels, discrimina- 
tion indices or correlation coefficients. With a non-SAPA control group, 
one could compare means on specific processes, etc. The minimum level 
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for "acceptable" index o»^dlscri:uinatioa of +0.20 was arbitrary but appro- 
priate. However, the "(Jpsired level of 50 percent" difficulty for items 
seems consistent with norm-referenced systems but not with the expressed 
intent to produce criterion referenced items. 

The use of an external criterion, performance on the ICM tasks, is an 
appjnbpriate procedure. The size of the \OA subtests was relatively small, 
from 12 to 15 tasks, so one wonders about the reliability of these scores. 
Reliability estimates "of subtests and total tests were not reported for 
either of the two instruments. It would be interesting to know if there 
were any similarities among the deleted items (or among the remaining ones), 
in terms of content assessed, r^adiilg demands, perceptual demands, mathe- 
matical complexity., etc. If there was a larger sample, one could explore 
commonality among items via a factor analysis and similar techniques. 

The 57 items ret^iined, called "successful" by the authors, were the basis 
of calculation of correlation among the process subtests of the Group Test 
and the subtests of the ICM. The results indicating non-uniqueness of the 
pro.cess as assessed by both the ICM and Group Test were disappointing. 

More sophijsticated means of analyzing the data may not have been appropriate 
with this sample, but it seems logical to consider using multiple regression 
(MR) or canonical correlation techniques. V/ith MR one* could determine the. 
relative contribution from each of the ICM process areas to the four pyo- 
cesses as asAe^ssed in the Group TeS^. This technique would also be most 
useful when d^ta from additional variables are available to quantitatively 
compare the relative contributions. Canonical correlations are appropriate ' 
when one has a set of predictor variables and a sdt of criterion variables, 
which is true -in this study: Outcomes from such analysis would enable 
one to determrine variance common to the tv/o sets of variables as well as 
ways in which the left and right se'ts of variables are individually sfruc- ^ 
tured. One'could obtain information assto the individual contribution of 
each variable from each set to the interrelationships among the two sets. 

The assessment of process skills is an importoTTtyarea for research by 
science educators. Tl^icre are many uses to which research findings and 
measurement tools in this area can he applied. It is hoped that this 



research is viewed as a first step of a multi-staged do,tnain. Exploration 
of> the relative performance of students from various programs and transfer 
to other content areas were sugc^ested by the authors. The contributions 
of student- achievement , psy^^l0^l^ science knowledge, or **inter- 

est in sciences" to student, process ability need to be explored. One 
could investigate^ the relative .effects of various instructional prb'^rams 
on students in general, or students witfi specific aptitudes. Tlie effect 
of process ability or success in concurrent and future science coursfes , 
post-secondary plans, etc. could be explored wheo/^some prior studies have 

een completed. It is hoped that these investigators will continue to 
wrestle with these research problems. ' ' , 
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Purpose ^ , * ;^ 

« 

The study involves formative evaluation of an Audio-Tutorial university 
physics couirse. This evaluation focused on teacher-student and student- 
student interaction dui;ing the course, and attitudes of students toward 
the laboratory and toward the ^course l.n general. 

The following hypotheses were tested: 

H-1 A higher percentage of A-T students than non A-T students 'will 
respond favorably to statements pertaining to attitudes toward 
the course in general, 

H-2 A "higher percentage of A-T students than non A-T students will 
respond favorably, to statements pertaining to attitudes toward 
the laboratory. ^ , ' 

H-3 A>higher percentage of A-T ^tudents than non A-T students will 
respond favorably to stater^nts pertaining to the availability 
of personal help and personal contact in the course, 

H-4 There will be no signifi^nt differences in achievement between 
the AtT group and the non A-T group. ^. • ^ 



Rationale 



The rationale for the study is course improvement. Conventional classes 
were assumed deficient in their lack ot syncronization between laboratory 
and lecture and in their lack student involvement duriifj^ Icirge lec- 
tu'rcs. Audio-Tutorial lessons wer^?*%3^stituted to overcome these, weak- 
ries'ses and the study was initiated to determine the effectiveness of 
the new program, , ' " 
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Research Deslf^n and Procodure 



Seventy students were randomly selected from a» class of 420 and were 
instructed using A«^T methods for ten weeks. The rest of the class 
(N=^50) studied in a conventional lecture , retitation, laboratory 
format . 

Mean SATV and SATM scores for the treatment and control groups were 
compared afld found to be virtually identical, In addition, a 
questionnaire was administered which asked about college major, 
couiTSafe taken^ih high, school, gr^es obtained, and grade aspirations 
in physics^ No differences between the treatment and control group 
were found on any of these characteristics, 

A 15-item questionnaire was administered to both treatment and oontrol 
groups after seven weeks of instruction and the results were analyzed 
to test hypotheses 1 through 3, Means on the three examinations 
given during the course were compared to test hypotheses 4, The' chi-- 
square statistic was used to test for significant differences in 
response to each item .on the 15-item questionnaire and ANOVA was used 
to test differences in means on the three examinations. 



Findings 

There were five items on the questionnaire which pertained to general 
attitudes toward the course. On four of the five items, the response '^ 
of tH? A-T students in'dicated a more favorable attitude than that of 
the conventionally taught students (a < ,01) Two items provided 
information about attitudes toward the laboratory and both produced 
responses which indicated more favorable attitude on the part of the 
A-T students, (a < ,001) Three items dealt with personal contact. 
All three Indicated that A-T students were more convinced than con- 
ventional students that they had sufficient opportunity for contact 
with instructors and other students (a < ,03). Thus, II-l, H-2, and 
H-3 were all accepted. 
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The re$ultf5 on the achievement tests were not as encouraging. There 
were no statistically significant differences in means for any o.f the 
three examinations. However, the author pointed out that there appeared 
to be slignt differences in favor of the conventionally taught students 
on each of the three exams (7 percent higher on Exam I, 6 percent higher 
on Exam II, 4 percent higlier on "Exam III), Even ^though nohe of fhese 
differences was large enough to rule out the possibility that it occurred 
by chance alone (a = .05), the author considered it to be of some practi- 
cal significance and attributed the difference to an emphasis on problem 
solving in the conventionally taught course. 



Interpretations 



The author concludes that "results of the attitudinal testing indicated 
that significantly more A-T than non A-T students responded favorably to 
questionnaire items related to attitudes toward the course in general, 
toward the labx>ratory, and toward personal contact in the course." . . . 
"overall, the A-T group's achievement was slightly below the non A-T 
grAip on course exams. The differences were not &<;atistically signifi- 
cant". . ." 




ABSTRACTOR'S 



Formative evaluation is done for the benefit of those teax;hing the course 
ording to the^author, the' exploration of the audio-tutorial format con 
t-inuecLaf ter thi§ study ended and Vthe course, as presently conducted, 
retains a basically audio-tutorial format with self-pacing and mastery- 
learning aspects and serves approximately 600-700* students each semester. 
The author's statement implies that the course was revised on the45asis 
of the evaluation reported. If so, this as it should be. Formative 
evaluation is done for pragmatic reasons. The data are' to be used for 
revision. 



Data -otiiiined from formative evaluation are usually of limited value to ^ 
"outsiders." Conditions vary from place to place and classroom practice 



must be influenced by classroom space, faculty load, scheduling patterns, 
budget, and other logistical considerations » It is difficult to report 
conditions under which a course was taught in enough detail for a person 
at a distant campus to decide the probability of obtaining similar 
results ifrom the same practice. 

This is not to say that studies of this kind are of no value. If enough 
are reported, general patterns begin to emerge. 

Based on studies that I have^'^ten, it is- probably that the more favorable 
attitudes expressed by A-T students in this study are representative of 
attitudes in general. This, of course, will vary from one situation to 
another. A bad A-T program will not elicit more favorable reviews th'an 

\^ stimulating lecture^ course but when both are well run, the A-T is 

^likely to get the higher marks. 

Th^^^inding of "no significant difference" in achievement reported in 
this study is also consistent with results reporte^^--£^r other^.. evaluations 
of A-T instruction. 

In a recent article, Kulik and Jaksa"*" report finding 24 studies jg^f A-T 
instruction which they considered "experimentall^^d^quate ," Of these* 
24 studies, 9 reported significantly higher final "ei^am scores in A-T 
classes, 2 reported superior performance in the conventional .classes, 
and 13 reported no significant difference in achievement. Thus, there 
is strong evidence that, on average, students. achieve no less in A-T 
. classes than in conventional classes and there is weak evidence that 
they may achieve more. It should be pointed out, however, that Kulik 
and Jaksa do not report the subject matter being taught in the 24 studies 
that they reviewed. It is quite possible that A-T instruction works 
better for some subjects than for others. Work done in an engineering 
course which stressed problem solving skills has suggested that A-T may 
not work as well in courses that stress analytical thinking as in 
courses that stress information acquisit4.on. This reviewer's impres- 
sions gained from reviewing research on A-T over the past several years 
is consistent with this. » '^^^ ' - \^ 
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In addition to evidence that A-T courses generally receive favorable 
comiRents from stu4ents and that achievement in A-T courses is equal to 
or slightly greater than achievement in conventionally taught courses, 
there is some evidence that time required for A-T courses is less than^ 
in conventioiral courses. The data available on this point are meager 
since most research on A-.T does not report information ab<jut time. 
However, Kulik and Jaksa (1977) report that four of the five studies 
that did provide this informatioi\ reported substantial reductions in 
the amount of time needed for s^utlent^^to complete ^th^ course work. 
The study being reviewed here does not report information on this point 

This reviewer has repeatedly suggested that evaluation'^tudies which 

ask, *'Is method A better than method B?** are of limited value. If 

students^ in A-T courses have more favorable attitudes, athieve more, 
i 

or take less time to accomplish the course objectives, we need to know 
what ^it is about the A-X^^approach that leads to these benefits^ Knowin 
that, we are in a bett;e?^osition to tell instructors at other institu- 
tions what they need to do to obtain similar results. Unfortunately^ 
most evaluation studies fail to give us this information we are 
left to guess. 

It seems likely that the greater flexibility of the A-T format may 
account for the more favorable attitudes found in this study. As one 
colleague put it, "Students like options." ThejA-T student certainly 
has more options than does the student in the oronventional class. 
Generally he has freedom ^to schedule his own time, to sequence learn- 
ing activities and perhaps even to decide which units 'are to be 
completed. This is seldom the case in conventional classes. 

It seems that the integration of laboratory and dj^actic material in 
the A-T format also leads to mDr8s.^f^orable attitudes. In the study 
^under review it was reported that almost 80 percent of students in non- 
A-T classes indicated that 'most lab work was not very informative 
'considering the tijiie spent on it. In contrast, only 40 percent of the 
A-T S'tudents felt this way. . Jn addition, 55 percent of the non A-T 
students in the study indicated that they had difficulty relating the 
lab work to the nuiterial from the rest of the course while only 26 



ERIC 



percent of the A-T students expressed this feeling. . Such differences 
-could have a marked effect on the attitude of a student toward a cour^. 

An interesting })it of information provided by this fopnative evaluation 
was not discussed? by the author — undoubtedly because it did not indicate 
a difference between A~T and conventional instruction. It was found 
that almost 70 percent of students in both A-T and non A-T classes indi- 
cated that they would like more opportunity to "mess around with physics 
apparatus and engage in non-quantitative experimentation/' Is what 
these students are telling us important? Perhaps our formative evalua- 
tion should be asking whether our rush to develop quantitative skills so 
important in the physical >^ciences^^3^ robbed students of opportunities 
to make the qualitative observations needed to understand the point of 
quantitative work. As science educators we seem to give far more atten- 
tion to instructional techniques than to content of instruction. One 
hears many heated debates on the relative merits of descriptive and 
theoretical topics in a beginning course but there is virtually no hard 
(I would settle for slightly gelled^) data bearing on the debate. Is 
it possible to obtain? I think so. It would eertainly be worth some 
exploratory research to fin^ out. 
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Voeiker, Alan '^'VAcTaentavy Schocd Children's Attainment of the Concepts 
of Physical and Chemical Changt--A Repl icationT" Journal of Reseai^ch 

Ifi Science Teaching^ l?(n:5-U, 1 975, ^ 

Descriptors~-Chcniistry; ''^Concept Fonnation^ -^Educational Research: 
Elementary Kducalion: Elementary School Science: '^'Instruction; 
^ ^^Maturation; Science Education; Teaching Concepts 

Expanded Abstract and Analysis Prepared Especially for I/S,E, by Donald 
E, Riechard, Emory University. 

^Purpose ' t ' 

The purpose was to^ determine the relative effectiveness of two methods 
of instruction in teaching -the concepts cfljhysical and chemical change. 
An associated question was to determine whether maturity of the children 
as represented by grade level was related to the children's level of 
understanding of the selected concepts. 

Rationale 

There, are two parts to the rationale developed for this study, . The first 
deals with the need for research evidence to support judgments about the 
placemeat of science concepts in tffe schools and how the related instruction 
shou'Ld be organized. The second part makes the case for replication of 
studies in science education. This study is a replication ^of one pre- 
viously completed by the author (Vo^lker, 1968). 

) • . 

Research Design and, Procedure 

*. 

The general research design involved application of two different treat-- 

ments to two different groups Vf subjects from each of three grade 

levels. There were no pretests. Subjects were post tested after treatment. 

The population for this sfudy consisted of all the children in grades 
four, five, and six of a single eiementary school in a large midwestern 
community. Each grade level had four separate classes. Two classes from 



each gr^de level -were randoml'y selected to pa-cticiparte^-in the study. 
The instructional treatments (trcatirent-one or treatment-^two) were 
randomly assigned to the two classes. Thuvs/-six classes were involved 
in thq study. Following instruction, ten children from each of the six 
classes were randomly selected for testing. 

The- primary difference in the two treatments was in the roles played by 
teacher and students. ^In treatment one (Tl) , the responsibility for 
formulating (discovering) the basic scientific generalization was on the 
student. In treatment two (T2), the teacher formulated and stated the 
generalization. , ' , 

Experimental procedures and testj.ng were conducted over four consecutive 
weeks. The author describes the testing as a "semi-clinical approach/' 
Individual children were asked to: 1) answer three verbal questions about 
physical and chemical change; 2) classify demonstrated and described 
phenomena as examples of either physical or chemical change; and, 3) 
justify the classification of each phenomenon. 

Analyses of the data were summarized in six tables. Analysis of variance 
(ANOVA) was the primary statistic used. ^ discussion of result3 is 
keyed to the information presented in the tables. 

> 

Findings ^ , ' " 

Answers to Verbal Questions. At the fourth grade level the mean scores 
of the Tl group were greater than those of the T2 group for each of the 
three questions. At the fifth and sixth grade levels, the mean, scores of 
the T2 group are greater than or equal to the. mean scores of the Tl 
group for each of the three questions^ However, none of the means was 
significantly different (alpha 0.05) for either main effects or inter- 
action effects. 
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Classifying Demonctmtcd and Dcacvihed riumc^nena. Fourth and sixth grade 
Tl groi'ip mean scores on de monstrated phenomena exceeded those for the 
respective grades' T2 group, ,The mean scores f-or the Tl group were 
greater than or equal to those of the T2 group at all grade levels when 
classifying described phenomena. Significant differences (alpha = 0.05) 
for the grade level effect were found for both described and demonstrated 
phenomena. For^ demonstrated phenomeija*, -the grade six mean was significantly 
greater ^than the grade five mean whereas the grade six mean w^s significantly 
greater than both the fourth and fifth grade means on described phenomena ^ 
(Newman-Keuls, p = 0.05). • . ^ ^ 

An item analysis revealed that eight of the fourteen' demonstrated phenomena 
were, correctly classified by 50 percent or more of the children receiving ^ 
Tl or T2 treatments. N^ne of the six described phenomerra was classified 
correctly 50 percent or more of the time by children receiving either 
Tl or T2. 



Justifying the Classification of Phenomena^ At both the fif,th and sixth 
grade levels, the mean scores for the T2 group were greater than the mean 
scores for the Tl group. There was a significant difference (ANOVA5 
p = i04) between the means for the grade level factor only. Although 
students had the opportunity to ask questions as a means of gathering 
more information about the phenomena*, the percent of instances where 

r 

Questions were asked was less than 1 percent. 

t 

Minimum Success Level. Minimum success levels were set for each of the 
three part's of the test (questions, classification, justification). Only 
the fourth grade Tl group met' the minimum criterion for answering the 
questions. The fourth grade T2 group met. the minimum criterion for 
classifying the two types of phenomena. Both sixth grade Tl and T2 
groups me^^'^Uxe minimum criterion for classifying demonstrated and described 
phenomena and supporting correct classification with acceptable resppnses. 
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Correlations. Correlations between children *s Scores on correct classificar 
tions of phenomena and support of correct classifications vith acceptable 
reasons ""were significant (p = 0.05) for four of the six grade level- 
treatment groups (GA-Tl, G5-T2, 06^71, G6-T2) , ' Correlations between 
test ^scores and standard achievement scores (reading vocabulary and com- 
prehension, arithmetic computation, spelling, and IQ x>7ere significant 
(p =* 0,05) for the grade 6-11 group only, 

4 

Interpretations 

The author states that this replication study has added credibility to. 
the follov;ing conclusions from the original investigation: 

1, It does not appear appropriate to attempt to te^ich the concepts 
of physical and chemical change prior to grade six if instruction 
is' to be in large group, teacher directed format, 'and the 
expectation is that children meet minimum criteria for aTl 
objectives set forth, 

2, A major inhibitor to concept formation appears to be maturation 
as a function of grade level, 

3, Ch'ildreti are ^ore able to reveal their understanding of^the concepts 
of physical change and chemical change through classifying 
phenomena rather than formulating definitions o^ applying con- 
cept definitions. 

A. Children can be expected to generate some knowledge for themselves. 

5, Children are not learning how to ask questions that^ will hel,p ^ 
them learn, 

6, There is a relationship between the ability to classify phenomena 
^ correctly and being able to explain why the resp^^ctive classifica- 
tions were made. 



7, The relationship between successMn developing those concepts 

f 

and achievement on^ standard tests appears to be more a function 
Of the classroom in which the student is found rather than a 
general factor across age level or treatment group. 



ABSTRACTOR'S ANALYSIS 

•Relationskip to Other. Studies. This investigation is related to a large 
cluster of studies on concept learning. The original investigation (Voelker 
1968) whose replication is reported here was co.nducted through the 
Wisconsin Research and Development Center. for Cognitive Learning at Madison, 
Ntnnerous science concept stupes came put of that R&D Center at about 
the same time. Among them/were those by Carey (1968), Helgeson (1968), 
Pella and Ziegler (1962^^ Stauss ^1968), and Triezenberg (1968X. 

It is not feasible here to produce a complete lis'ting of investigations 
related to this study and the general area of concept learning in science. 
It is noteworthy, however, that published research expressing concern 
for/children' s cognitive d^elopraent in science can be traced back many 
years. Hall'^ study, for example, dates into the last century (1891). 
The work of Piaget and *his associates, begun in the 1920's, surely'is of 
much ^significance. Examples of some of the'^more recent studies related 
to 'science concept, learning are those r(^ported by Anderson (1965), Haney 
.(1965), Howe' (1974), Lawson & Wollman (1976), Raven (1974), and Riechard 
(1973), 

Because of the voluminous nature of the condept learning J.iterature, 
the reader is urged to enlist the services of ERIC/S>nilAC in the search 
for specific research jE^ela ted to his or her own Study, Other . sources ' 
(some^avatlable through ERIC/SMEAC) are the Various concept learning 
bibliographies ^uch as those 'by Klau'smeier, et al. (1969) and Voelker (1973) 



Contributions of the Study. The results of this study are generally- 
consistent with that which is known about cognitive ^development . Further, 
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taken with the original investigation, there is good evidence to suggest 
the appropriate grade placement for the concepts of physic*al and chemical 
change. It is this, abstractor belief , /however, that a major contribution 
of the investigation lies in the fact that it is a replication. The author 
makes a sound, case for replication studies and it is likelv that most ^ 
science educators agree with him, However, a cursory review of the 
literature shows that very few replication studies ever appear (probabJ.y 
less than five percent of the total research published), Investigators 
and those responsible for directing research (e.g,, graduate student 
research, etc,) should consider the need 'and potential contribution of 
replication studies. It seems that replication of selected works could 
make a far more substantive contribution than the continuation of one- 
shot, bits-and-pieces research, 
# • • • 

Currerit State of Researoh in the Field* As suggested above, much research 
has been done on concept learning, A good-bit of it has been done in * 
science education. Four basic concerns can be identified relative to that 
research,' First, different studies are seldom tied together as they are - 
planned and produced ojr after the results ^re published. Some positive 
signs can be identified, however. The Wisconsin R&D Center, 'as mentioned 
above, made substantial progress in planning and producing related* research 
in science concept learning. More efforts of thi's nature are needed. 
And this ERIC publication. In vestigations in Science Education ^ is a ^ 
welcome effort toward organizing and examining ^'clusters" of related 
research that have appeared in the literature, 

— ^ 

Second, many of the studies have been one--shot rese,arch. Host have not . . 
been replicated; some could , not be replicated. Stuides need to be re ^ 



replicable and replicated! 



Three, much of the research has been done with children seven years of ' 
age or older. Given the psycho^logical research which suggests the importance 
of intellectual growth prior^to age seven, there is need to study concept 
development in the early years, 
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And finally, even consiciering tho three concerns a^^ove, more is kno\^ ' 
about concept learning in science than is practiced in the classroom, 
Too many teachers, for example, still try to teach science concepts by 
the single method of requiring ^students to read the chapter and write out 
the answers to, questions at the end. Thus<^here is need to decrease 
the lag time betweeft' research, and practice. 

Comments on the Study. Jhe research design for this study is- adequate 
and, in general, the written report- is well done. Language is precise 
and the material presented is organized into sections which flow smoothly 
from one to the next. Th^ data appear to be analyzed thoroughly and 
appropriately. * 

*As with all reports, however, some questions and concerns can' be identified 
Neither the number of students making up the total population no*r tKe numbe 
of subjects in each treatment group is stated, fox example. How large was 
each class and was each class instructed as a total unit?* The number in 
each instructional group is of major importance gi^en the reference to 
group size in interpretation nymber one, above. 

The liberal use of tables prov^ides an excellent aid in the examination 
^ of results. Tables IV and V are not completely cle^r at first reading, 
however, and require more careful study than the other tables. Further, 
the discussion of item analysis implies that Table V deals with both 
demonstrated and described phenomena; in fact, it deals only with d'emon- 
strated phenomena. It is also noted that the relationships among the 
grade-level scores on the described phenomena are missing from the ^ 
footnote in Table II, 

Finally, the author appropriately related his conclusions (see section 
oxi^ inte^pr^artiqnsy above) to the original investigaHon, This should be ^ 
dftne in replication studies. Wliile the results of this study generally 
support tlffl* conclusions from the original investigation, the replication 
data are not without incongruences. For example, the fourth grad* groups 
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fre^uefetly outperformed the fifth graders and^vcn the sixth grade groups 
on a f ew^ measures . ^Why? The question cannoL be answered here. One 
possible amswer is sampling bias. In this study, the sample was selected 
from one school only, where students had been assigned to classes prior 
to the investigation. Criteria for selection of the school and assign- 
ment of students to classes were not defined. There were no pretests 
to. determine stkdent performance prior to instruction. The author made 
good use of randomization on tbosfe things over which he* had control. The 
study would have been more generalizable with less chance of sampling 
bias, hox^^ever, had \he population been larger and more diverse, if all 
assignments had beenVandon;, and if pretests had been used. Th^ reader 
should recognize, however, that? restraints are often imposed on behavioral 
research. The researchW frequently has to work with "what is'* rather tten 
"what ought to be," Whil^ some questit^ns about sampling bias in this 
investigation can be raiseci, they do not detract greatly from the overall 



merits of the study. 
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Descriptors — ^Attitudes; Educational Research Higher Education; 
Individualized Learning; 'fPrcservice Education; Science Educa- 
tion; '^Science Teachers ; ^'Teacher Education; ''Teaching 
\ Procedures 

Expanded Abstract and Analysis "Prepared Especially for I.S.E. by Victor 
J. Mayer, The Ohio State University. 



. Purpose ' • ' 

The purpose of the study was to determine the effects of a teaching exper 

ience upon undergraduate students' attitudes toward teaching and certain 
instructional practices. The instructional objectives evaluated were not 

listed, however, they can be ^inferred from the "basic Rationale" for the 
e3q)erience: ^ 

1) tQ allow the students ^enrolled to make a rational , decision 
^pertaining to ^he feasibility of teaching as a career, and 



4 



2) to provide first-hand experience of teaching to insure maxi- 
. 'mum gain from advanced, professional education coursework to 



be taken, 

'Rationale 
Not provided. 



Research Design and Procedure * - ^ 

,The design used wa^ the on^ group pretiest and posttest design. The 
instrument used was a semantic differential to assess changes in atti- 



tudes. » It h(id eleve]?^ concepts and a scq.le consisting of ten adjective 
df ' * * * 

pair3 *eacb with seven seman-tic spaces. A twelfth variable was deter- 
mined -by summih'g^~the -scares' of the eleven concepts. J 
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The expericivoe evaluated consisted of a one-semester participation in 
the daily activities of a public school. It enrolled 21 sophomores 
at the University of Icfwa, 



Findings 



A positive change in the pretest and posttest scores, significant at 
the ,05 level of confidence-, waB found for 11 of the 12 variables. 

% 

TABLE I 

Pretest and Posttest Results of Attitude Towaid Selected Educational Concepts as 
Measured By The Semanttc Differential for Exploratory Teachirrg Participants 





Pretesyi/\ean 


Posttest MeaS& 


t 


Individualized Learning 


22.67 


11.48 


6.39 • 


Being a Science Teacher 


^ 2S.38 


15.38 


5.15* 


Teaching Secondary Students 


' 25.24 


14.81 


6.45* 


Interaction 


21.05 


7.62 


7.25* 


Content-Oriented Approach 


' V 33.57 


26 86 


3.63' 


Classroom Mangemeni 


^ 35.43 


30.10 


2.84' 


Science Teaching Materials 


25.05 


14.00 


5.52 • 


Teaching Elementary Students 


25.05 


♦ 12.43 


8.09' 


Process-Oriented Approach 


22.76 ' 


11.67 


6.69 • 


Importance of Discipline. 


31.81 , 


28.57 


1.76 


Early Exploratory Teaching 


23.95 


15.57 


3.11* 


Total Composite Score 


293.33 


188.76 ^ 


11.54' 


Critical Region = 1 1 > 2 08 
•Significant at the O.Ob love) N 


= 21. df = 20 







In t e r p r e t a t i on s 

The exploratory teaching experience resulted In a positive change in 
attitudes • 



The following are additional interpretations made by the author: 

The changes are indicative of the development of a more sensi- 
tive, confident, •humanistic, and laboratory-oriented prospective 
teacher. Furthermore, ^hc objectives of the program were appar- 
ently^*. fulfilled as the data would indicate that the participants 
have developed insip,hts into teaching v^hich should allow them to 
>s, make more rational decision (yic) as to whether teaching is a 
^potential career. 



ABSTRACTOR'S ANALYSIS 

No rationale is provided £fir the use of an ex^oratory teaching experience; 
no summary of appropriate studies. The reader is left to his devices 
to try to determine what led the author to providing such an experience 
for his students. There is substantial literature in science education / 
bearing upon this question. 

No rationale is provided as to why an exploratory teaching Experience 
should affect student' attitudes tov;ard teaching. Again there is sub- 
stantial literature. I would hope that at least those studies using a 
semantic'differential, such as the one by James (1970) would be cited. 
Only one, however, relating to preservice training is cited in the intro- 
duction. No attempt is made to relate the results of this study to 
those of others having a similar purpoae. 

» 

No discussion is provided concerning the metliods used in the deyelopment 
of the instrument, measures used to insure validity, nor estimates of 
reliability. No discussion of the dimensions commonly comprising semantic 
differentials (evaluative, potency, activity and undersj;anding) ahd 
found to be relatively stable is provided as an under^feining to the 
development of X\]e instrument. It would sfeem. that five of the adjective 
pairs ^are evaluative, two are understanding, one is potency and two do 
not seem to fit any of the fiur dimensions. What was the rationale for 
this distribution; Do tlie fojur dimensions show different patterns of 
response? Was a factor analysis attempted? How were the concepts 
selected? l^e adjective pairs? They relate to the validity of the 
instrument. The author does state that the concepts selected Were deter- 
limned by field testing, Howevey, no information is given on the pro- 
cedure used, ^ 

The interpretations offered by the author are not possible with the 
design used. There Is <i large body of literature indicating very weak, 
if any, relationship between attitudes and behavior. Yet the author 
concludes that changes dn attitudes documented in the study ''are indicative 
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of the development of a more sensitive, confident, humanistic, and 
laboratory-oriented prospective teacher'*. ... implying that his students 
will behave in those ways. The results of the study may demonstrate 
the formation of attitudes that would favor the development of s^sitive, 
confident, humanistic and laboratory-oriented prospective teachers, 
however, I doubt that they are indicative of such development, 

The article is written in a clear, succinct style, It is easily under- 
standable. However, in addition, to the problems previously addressed, 
however, ther^ is some inappropriate use of terminology. This* is not 
an experimental study. At best it is an evaluation of a pilot program. 
Yet a description of the program is entitled "experimental treatment," 
The lead sentence under "Analysis and Findings" states- that "the null 

hypothesis stated that there w.as no,,, etc," The null hypothesis is 

s 

never rejected. Instead, it is later stated that a significant difference 
was found for eleven of the twelve variables. , One wonders why the 
term "null hypothesis" is ever introduced. Perhaps for^^e same reason 
'that many investigators use null hypotheses as research hypotheses in 
reporting their studies;^ they seem to lend a certain amount of sophistica- 
tion. In a descriptive study suc^h as this they are superfluous as, 
the author seems to concede since it is not brought up again after its 
introduction^ Null hypotheses are statistical devices \ised in the 
analysis of data. 'Although not ustsd in this study as research hypotheses, 
they have been so used in many studies appearing in the science 
edycatioft literature. Such uses are inappropriate arid misleading. 
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